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(57) Abstract: A method is disclosed for the in vitro or in vivo cyclization of protein or peptide sequences. Also disclosed is 
a method of fusing polypeptide sequences while bound to a solid support. These protein manipulation techniques relied on the 
/rans-splicing activity of a split intein, such as the naturally occuring split intein form the dnaE gene of Synechocystis sp. PCC6803 
(Ssp DnaE intein). The cyclization procedures required the fusion of C- and N-terminal intein splicing domains to the N- and C-ter- 
mini. respectively, of a target protein (InteiUc-target protein-InteinN). Cyclization in vivo occurred post-translationally when the two 
complementary intein splicing domains ligated the N- and C-terminus of the target protein. In vitro cycUzation also utilized and 
Inteinc-tai^et protein-InteiuN precursor protein, in which the intein domains were fused to a chitin binding domain (CBD). Protein 
expression was conducted under conditions that favored the accumulation of precursor protein, which was immobilized on a chitin 
resin. The circular protein species were eluted from the chitin resin following incubation under conditions that favored protein 
splicing. Trflrts-sphcing was used to ligate polypeptides on a solid support by generating a protein composed of a CBD fused to a 
C-lerminal intein splicing domain and target protein (1 ). This was incubated with a protein composed of target protein (2) fused to 
an N-terminal intein splicing domain and a CBD. The precursor proteins were inunobilized on a chitin resin where /raws-splicing 
resulted in the ligation of target protein (1) to target protein (2). These techniques greatly expand the procedures available for protein 
engineering and modification. 
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METHOD FOR PRODUCING CIRCULAR OR MULTIMERIC PROTEIN 
SPECIES IN VIVO OR IN VITRO AND RELATED METHODS 

BACKGROUND OF THE INVENTION 

There are a number of modifications that proteins can undergo following 
translation. Some of the many post-translational modifications result in proteolytic 
processing or the cbvalent linkage of an important functional group to the protein. 
An interesting post-translational modification is the head-to-tail cyclization of a 
protein or peptide to form a continuous peptide backbone. Many of the naturally 
occurring circular peptides posses anti -bacterial activity, such as the AS-48 peptide 
(Samyn, et ak, FEBS Lett., 352(1) 87-90 (1994)). Also, these antibacterial peptides 
have been found in organisms as divergent as bacteria and primates (Samyn, et al., 
FEBS Lett., 352(1) 87-90 (1994); Tang, et ah, Science, 286(5439) 498-502 (1999)). 
One possibility for forming a cyclic protein species may be that the peptide or 
protein is more conformationally stable once its N- and C-termini have been 
constrained. 

In addition to the naturally occurring cyclic peptides a number of synthetic 
techniques have been developed to generate synthetic circular peptides* 
(Tarn and Lu, Protein Sci.\ 7(7) 1583-1592 (1998); Romanovskis and Spatola, / 
Pept. Res., 52(5) 356-374 (1998); Camarero and Muir, J, Amen Chem. Soc, 121 
5597-5598 (1999); Valero, et al:,7. Pept: Res:. 53(1) 56-67 (1999)). However, due 
to the limitations of total chemical synthesis it is difficult to generate' synthetic 
cyclic peptides larger than 100 aniino acids. This was circumvented using intein 
based technologies that alldwed ribosomally synthesized proteins to cyclize in a 
head-to-tail fashion in vitro (Camarero and Muir, 7. Amer. Ghent Sac, 121 5597-1 
5598 (1999); Evans, et al., / Biol Chem.,274 18359-18363 (1999); Iwai and \ 
Pluckthun, FEBS Lett., 459 166-172 (1999)). However, these procedures did not j 
allow the cyclization of a protein or peptide in vivo for study in a living organism. / 

The in vitro cyclization of ribosdrhally synthesized proteins utilize the 
activity of protein splicing elements (termed inteins Perler, et al:. Nucleic Acids Res., 
22 1 125-1 127 (1994)). Inteins, catalyze their own excision from a primary 
translation product with the concomitant ligation of the flanking protein sequences 
(reviewed in Paulus, Chem. Sac Rev., 27:375-386 (1998), Perler, Cell 92(1)1-4 
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(1998) and Shao and Kent, Chem. Biol. 4(3): 187-194 (1997)). = Inteins catalyze 
three highly coordinated reactions at the N- and C-terminal splice junctions (Xu and 
Perler, EA1B0 7..15(19):5146-5153 (199.6) andChong, el ai., / Biol. Chem., 
271:22159-22168 (1996)): I) an acyl rearrangement at the N-terminal cysteine or 
serine; 2) a transesterification reaction between the two termini to form a branched 
ester.or thioester intermediate; and 3) peptide bond cleavage coupled to cyclization 
of the intein C-temninal asparagine to free the intein. Inteins have been engineered 
to be versatile tools in protein purification (Chong, et al., Gene, 192(2) 271-281 

(1997) , Chong, et al., Nucleic Acids Res. 26(22):5109-5115 (1998), Evans, €t al., 
Protein 5c/., 7:2256-2264 (1998), Mathys, et aU Gene 231:1-13 (1999), Evans, et 
al., / Biol. C/?g/«., 274:3923-3926 (1999), Southworth, et aU Biotechniques, 
27:110-120 (1999) and Wood, et al.. Nature Biotechnology, 17(9):889-892 (1999)), 
protein ligation (Evans, et al.,.Prarein .So. ,,7:2256-2264 (1998), Mathys,- et al.. Gene 
231:1-13 (1999), Evans, etal., /. Biol. Chem., 274:3923-3926^1999), Southworth, 
etal.,fi/orec/i/j/^Mej, 27:110-120 (1999), Cotton, etal., / Ah/. Chem. Soc. 

121:1 100-1 101 (1999), Muir, et al., Proc. Natl. Acad. Sci. USA. 95:6705-6710 

(1998) , Severinov and Muir, J. Biol. Chem. 273:16205-16209 (1998), and Xu, et al., 
Proc. Natl. Acad. Sci. USA 96(2):388-393 (1999)) as well as in the aforementioned 
formation of cyclic proteins and peptides.(E vans,. et al., 7. Biol. Chem. 274:18359- 
18363 (1999), Iwai and Pluckthun, FEBS Lett 459: 166-172 (1999) and Gamarero 
and Muir, J. Amer. Chem. 5oc., 121:5597-5598 (1999)). Limitations of these 
intein technologies include the necessity of generating an N-terminal cysteine 
and/or C-terminal thioester, inteirnediate in vitrp for ligation or cyclization, the need 
to perform extra purification steps tp sejparate ui]ligated reactants from the ligation 
products and the requirement of a deriatijrant to permit in vitro /ra/j^-splicing 
reactions (Yamazaki, et al., J. Am.. Chem. Soc. 120:5591-5592 (1998), Mills, et al., 
Proc. Natl. Acad. Sci. USA, 95(7):3543-3548 (1998), and Southworth, et al., EMBO 
7., 17(4):9 18-926 (1998)). , .. 

In addition to the c/i-splicing inteins and those engineered to rra/i^-splice 
(Yamazaki, et al., J. Am. Chem. Soc. 120:5591-5592 (1998), Wu, et al., Biochim. 
Biophys. Acta, 1387:422-432 (1998), Mills, et al., Proc. Natl. Acad Sci. USA, 
95(7):3543-3548 (1998), Otomo, et al., /. Biomol. /VM/?, 14(2): 105-1 14, Oiomo, et 
al.. Biochemistry, 39(49): 16040: 16044, and Southworth, et al., EMBO J., 
17(4):91 8-926 (1998)), a naturally-occurring split intein was recently identified in 
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the dnaE gent encoding the catalytic subunit of DNA polymerase III of 
Synechocystis sp. PCC6803.(Wu, et al., Proc. Natl, Acad. Sa\ USA, 95(16):9226- 
9231 (1998)). The N-terminal half of DnaE, followed by a 123-amino acid intein 
sequence, and the C-terminal half, preceded by a 36-amino acid intein sequence, are 
encoded by two open reading frames located more than 745 kilobases apart in the 
genome. When co-expressed in E, coli, the two DnaE-intein fragments exhibited 
protein rra725"-splicing (Wu, et al., Proc. I^ati Acad. Sci. USA, 95(16):9226-9231 
(1998)), 

Accordingly, it would be desirable to utilize intein technology in developing 
methods for producing circular or multimeric protein species in vivo or in vitro. 
Such methods would permit the formation of cyclic polypeptides in new hosts, 
facilitate the separation of products from reactants when ligating proteins for 
isotopic labeling, and allow the generation of cyclic polypeptides that are sensitive 
to reducing agents. 

SUMMARY OF THE INVENTION 

The abbreviations used herein are: 

"Ssp DnaE intein" means a naturally split intein from the dfiaE gene of 
Synechocystis sp.PCC6803\. ^ , , 

"DnaE(N)" means the N-terminal 123 amino acid residues of the Ssp 
DnaE intein; 

"DnaE(C)'' means the^G-terminal 36 amino.acid residues of the Ssp DnaE 

intein; 

"MBP" means maltose binding protein; 
"CBD" me;ans chitin binding^ domain; 
"Fx a" means factor Xa; 
/TTS" means intramolecular /ran^-splicing. 

In accordance with one embodiment of the present invention, there is 
provided a method for producing a circular or multimeric protein species in vivo or 
in vitro. The steps comprising the in vivo cyclization or multimerization reaction 
consists of fusing the C-terminal splicing domain of a protein splicing element (an 
intein) to the N-terminus of the target protein and the N-terminal splicing domain of 


wo 01/57183 


PCT/USOl/03147 


-4- 

an intein to the C-terminus of the same target protein and expressing the fusion 
protein in the desired organism at the temperature permissive for intein splicing. 
Cyclization occurs when the two splicing domains from the same target protein 
interact and spHce whereas multimerization occurs if the two splicing domains from 
two different target proteins interact and splice. 

The intein splicing domains are also deferred to herein as intein fragments. 
The intein fragments are chosen so that they represent complementary trans- 
splicing domains. These complementary intein fragments could be choser^from the 
known rra/25-splicing inteins (Yamazaki, et al., J, Am, Chem. Soc. 120:5591-5592 
(1998), Wu, et al., Biochim. Biophys. Acta, 1387:422-432 (1998), Mills, et al., Proc. 
Natl. Acad. ScL USA, 95(7):3543-3548 (1998), Wu, et al., Proc NatL Acad. ScL 
USA, 95(16):9226-923r (1998), Otomo, et al., 7: BiomoL NMR, 14(2):105-114, 
Otomo, et al., Biochemistry, 39(49): 16040- 16044, and Southworth, et al., EMBOJ,, 
I7(4):918-926 (1998)). The intein fragments used in the present study were the N- 
terminal 123 amino acids and the 36 C-terminal amino acids of the of the Ssp 
DnaE intein, respectively. 

This intein based technology allows naturally occurring circular proteins to 
be expressed in organisms that are not the native host. Furthermore, this 
technology permits a wide range of circular proteins, including those not found in 
nature to be expressed in such organisms. 

In vitro cyclization also involves fusing the C-terminal splicing domain of 
an intein to the N-terminus of the target protein and the N-terminal splicing domain 
of an intein to the C-terminus of the saiiie target protein as described above. 
However, expression of the fusion protein in the desired organism is carried out 
under conditions that are not permissive for intein splicing. Also, one or both intein 
splicing domains may carry an affinity tag that allows immobilization on an affinity 
resin. 

The present invention is exemplified by, though not limited to, the intein 
found in the dnaE gene of Synechocystissp. PCC6803 (Ssp DnaE intein). The 
first step in the in vitro cyclization reaction is the generation of the full length 
precursor protein, either by ribosomal synthesis or by total chemical synthesis. The 
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full length precursor protein is then immobilized on a solid support, such as a chitin 
resin. Protein cyclization occurs when the resin containing the bound protein is 
equilibrated at the appropriate temperature and pH to allow splicing to proceed. As 
described above for the in vzvo case, cyclization occurs if the two splicing domains 
from the same target protein interact and splice whereas multimerization occurs if 
the two splicing domains from two different target proteins interact and splice. 
Following the /ra/z^-splicing reaction to generate the cyclic protein species the final 
products were eluted from the chitin resin. This method is unlike previous intein 
based in vitro cyclization techniques, referenced above, and may be used to^ 
circularize proteins that are sensitive to the reducing agents used in the other 
procedures or aie not amenable to use with other inteins. 

. The present. invention also describes a method for ligating two protein 
fragments on-column and separating away the reactants by elution from the affinity 
resin. The steps involved comprise fusing target protein 1 to a C-terminal intein 
splicing domain while target protein 2 is fused to an N-terminal intein splicing 
domain. 

An affinity domain can be fused to both or either of the N- and C-terminal 
intern splicing domains so that the N- and C-tenninal protein fusion molecules can 
be immobilized on an affinity resin. The affinity domain exemplified herein is the 
chitin binding domain from B. c/raJa/75XWatanabe, et al., 7. Bacteriol, 176:4465- 
4472 (1994) ). Following the generation of the precursor proteins in the present 
invention they are applied to a chitin, resin. The immobilized proteins are ligated 
together when favorable conditions, exist .to permit the cornplementar>' intein 
fragments from separate molecules to undergo the rra/2^^-splicing. reaction. 

Following rra72^-splicing the ligated protein products were no longer fused 
to the intein fragments or the affinity domain and so these products were isolated 
by eluting it from the chitin resin. In contrast, the unused reactants remained 
bound. This permits the localization of the ligation reaction and overcomes many 
of the disadvantages and problems of the previous technologies noted above. 
Specifically, the present invention allows the facile separation of the ligated protein 
species from the unused reactants and the use of inteins such as the Ssp DnaE 
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iniein eliminates the need for the denaturani treatment step to pemit the trans- 
splicing reaction. - ' ' ' 

■■ ■ ' The present invention also relates to a method for controlling cleavage at the 
intein splice junctions. In this embodiment, a protein is fiised to the N-terminal 
intein splicing domain. This fusion protein may or may not be purified. Cleavage 
of the fusion protein is induced by the addition of the complementary C-terminal 
splicing domain, that may or may not contain a mutation to blodk any potential 
splicing activity. Alternatively,- a protein is fused to the G-terminal intein splicing 
domain. This fusion may or may not be purified. Cleavage is induced by the 
addition of the N-terminal intein splicing domaiin. In both of the above instances, 
the cleavage reaction may be accelerated using reagents or conditions that increase 
the rate of cleavage, such as thiol reagents, pH, or temperature. This mechanism of 
controllable cleavage has beisn termed /ran5-cleavage. ■ 

BRIEF DESCRIPTION OF THE DRA WINGS 

Figure I depicts the Ssp DnaE intein cis- and rra/i^-splicing constructs. - 
The c/^-sphcing constructs, pKlEBS, pMEB8-N2; pMEB8-CI, -C2, and -C3 all use 
maltose binding protein (MBP) and thle chifiri binding domain (CBD) as the N- and 
C-exteins, respectively. The differences are in the extein residues adjacent to the - 
intein and are represented by their single letter code for ease of c The i 

constructs used in the two plasmid, fm/75-splicing^ system were pMEB4 and 
pKEBI which contain theN- and'C-termihal DnaE intein fragments,'DnaE(N) 
and DnaE(C), respectively. The intramolecular rrmz^'-splicing construct, pMEB21, 
placed DnaE(C) and DnaE(N) at the and C-terminus of MBP, respectively. 

Figure 2 is a gel depicting the //V Vjvd splicing of the S^p DnaE intein b 
SDS-PAGE. : 


E (18 kDa) and the cleavage products M (43 kDa) andEB (25 kDa) are visible. 

Lane 3, cioide cell extract after a 2 hour induction at 37°C, Lane 4. Crude cell 

extract after a 2 hour induction at 37°C followed by overnight incubation at 15°C. 

Figure 2B, The cw-sp)icing activity of the Ssp DnaE intein in vivo with 
mutated exiein residues . Xane 1, in vivo splicing activity with 2 native N-extein and 
, 3,native C-extein residues (pMEB-N2). The splicing of the Ssp DnaE intein with 5 
native N-extein residues and 1 (pMEB8-Cl, lane. 2), 2 (pMEB8-C2, lane 3) or 3 
. native C-extein residues (pMEB8-C3, lane 4). 

Figure 2C, The 5^ DnaE jntein in vivo rrajw-splicing activity investigated 
by co-expression of the MBP-DnaE(N) (57 kD^) and the DnaE(C)-CBD (10 kDa) 
fusion proteins. Lane 1, uninduced crude cell extract. Crude cell extract after 

induction of protein expression ati5°C (Lane 2), at 3fC (Lane 3) and at 37°C 

followed by incubation with shaking at 15°C (Lane 4) all displayed precursor 
(MEB), spliced product (MB) and cleavage product (M). Lane 5, induction of 
protein expression with ME(N), but not E(C)B, displayed no detectable splicing or 
cleavage. All samples were analyzed by Coomassie Blue stained 12% SDS-PAGE 
gels. 

Figure 3 is a gel diepicting protein rra/i^-splicirig and cyclization reactions 
using the 5j7? DnaE intein. ■ . 

Figure 3A, Inteimolecular rra/jiy-spiicing (ITS). The association of the N- 
terminal and C-lerminal Ssp DnaE intein fragments, DnaE(N) and DnaE(C), 
respectively, aligns the two splice junctions for the fusion of the N-and C-extein 
sequences. The splicing reaction, presumably occurs via the same splicing pathway 
as the c/^-splicing pathway proposed previously (Xu and Perler, EMBO J., 
15(19),:5146-5153 (1996) and Chong, et al., J. Biol. Chem. 27 L 22 159-22 168 
(1996)). Cleavage at the N-temiinal splice junction can occur by hydrolysis or 
nucleophilic attack of the thioester bond formed at the C-terminus of the N-extein. 

Figure 3B, Intramolecular rran^-splicing (ITS). A target protein is 
sandwiched between the intein C-terminal segment (36 amino acids) and the intein 
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N-terminal segment (123 amino acids). Splicing joins the N-terminus of the target 
protein to its own C-teiminus through a peptide bond. The presience of a chitin 
binding domain (CBD) fused to the C-terminus of the intein N-termina! segment 
: facilitated purification of the precursor protein and the subsequent./// vitro 
cyclizati on reaction on chitin resin. 

Figure 4 is a gel depicting the in vitro /ra/?5-splicing of the Ssp DnaE intein 
as well as the rrar^-cleavage reaction. 

Figure 4A, 10-20% SDS-PAGE gel of peptide induced splicing and 
cleavage of MBP^DnaE(N)-CBD (ME(N)B). Lane 1 , arhylose purified ME(N)B 
(64 kDa). Lane 2, ME(N)B, 1 mg/mL, following overnight incubation with the 
splicing peptide' (Splice-pep, 500 :M). The cleavage product (M) and spliced 
product (M-CFNK*) co-migrate at 43 kDa. Lane 3, ME(N)B, 1 mg/mL, after 
overnight incubation with the cleavage peptide (Cleav-pep, 500 :M). Both Splice- 
pep and Cleav-pep are based on the C-tenninal fragment of ihc Ssp DnaE intein as 
described in the Example III and Example IV. 

Figure 4B,rra/i5-splicing of .MBP-DnaE(N)-CBD(ME(N)B) and CBD ^, 
DnaE(C)-T4 iigase (BE(C)L) examined by 12% SDS-PAGE. Lane 1, crude cell 
extract after induction of ME(N)B expression. Lane 2, ME(N)B following 
purification over an amylose column. Lane 3, ME(N)B that was bound to chitin 
beads and eluted with SDS. The chitin binding domain (CBD) permits binding to a 
chitin resin. Lane 4, crude cell extract after induction of BE(C)L expression. Lane; 
5, chitin bound BE(C)L that was removed by treatment with SDS. Lane 6, 

incubation of amylose-purified ME(N)B and chitin bound BE(C)L at 4°C for 16 
hours followed by elution of the chitin resin with SDS. The fusion proteins 
ME(N)B and BE(C)L were bound to separate batches of chitin beads and the chitin 
bound proteins were mixed followed by, elution of the beads with SDS after 

incubation at 4°C (lane 7), le^'C (lane 8) or 37''C (Ikne 9) for 16 hours. Lane 10, 
the supernatant from the chitin bead mixture described in laiie 8. Note that the 
spliced product (ML) is free in solution while the reactants remain bound to the 
chitin beads. Lane 11, Factor Xa (Fxa) treatnient (1:100 FXa:ML) of the 
supernatant fraction. M, MBP (43 kDa). L, T4 DNA ligase (58 kDa). 


rr , . . , . I , 

Figure SA, in w'vo protein cyclizalion. Lane 1 , uninduced crude cell extract. 
Lane 2, crude cell extract following induction at 15"C contains the precursor 
DnaE(C)-MBP-DnaE(N)-CBD (E(C)ME(N)B, 65 icDa), cyclic MBP (c-MBP, 47 
kDa), linear MBP (1-MB.P, 43 kDa), the DnaE(C)-MBP (E(C)M, 45 kDa), and 
DnaE(N)-CBD (E(N)B,,23 kDa): Lane 3, clarified cell extract from Lane 2 
following passage over an amylose column. Note that the cyclic maltose binding 
protein (MBP) binds to amylose. Lane 4, proteins eluted from amylose resin. Lane 
5. the eluted sainnle inciihared with Fartnr Ya H • 1 00 Fyj)'\/rRP\ PYo trpatmpnt 

also resulted in the release of.a 45 kDa species corresponding to E(C)M- 

: Figure 5B, in vitro protein circularization. Lane 1, unindiiced cell extract. 
Lane 2, crude cell extract following induction at 37°G.;, Lane 3^ clarified cell extract 
following passage over a chitin column. Lane 4, proteins eluted from the chitin 
column following incubation at 23°C for 16 hours., Lane 5, incubation of the chitin 
eluted sample with FactorXa (LlOO FXaiMBP). All reactions were performed as 
described in Example V and were analyzed on a 12% SDS-PAGE gel. 

DETAILED DESCRIPTION OF THE INVENTION 

The cyciization and.ligation methods of the present invention are based on 
the discovery that splitJnteins, are capable of fra/i^-splicing either in vitro or in vivo 
(U.S. Patent D^o. 5,834,247; Mills, et al., Prpc, Natl, Acad, Sd USK95{1) 3543- 
3548 (1998); Otpmo,etal.,y. BiomoimR, 14(2) 105-114 (1999); Shingledecker, 
et al.. Gene, 207(2) 187-195.(1998); Southworth, etal., EMBO J., 17(4) 918-926 
(1998); Yamazaki, et al., J. Am. Chem. 5oc., 120 5591-5592 (1998)) ■ 

The ligation .procedure disclosed herein utilizes a split.protein splicing 
element, an intein (Perier, et al.. Nucleic Acids Res., 22 1125-1127 (1994)) to join 
the N- and C-termini.of the same or separate protein species. Previously, the 
ligation of the N- and C-termini of sepai-ate protein sequences was described using 
an intein (CIVPS) that was artificially split (U.S. Patent No. 5,834,247; Mills, et al., 
Proc. Natl. Acad Sci. USA, 95(7) 3543-3548 (1998); Otomo, et al., J. Biomol. 
NMR, 14(2) 105-1 14 (1999); Shingledecker, et al.. Gene, 207(2) 187-195 (1998); 
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• South worth, et al., EMBO J.. 17(4) 918-926 (1998); Yamazaki, et aL, / Avh Chem. 
Soc, 120 5591-5592 (1998).' However/the present invention describes how a split 
intein can be used to fuse two protein segments on a soHd support'. The present 
' invention rehes on the generation of precursor proteins composed of 
complementary intein fragments and the desired larget'proteins. 

The first step in the ligation of proteins on a solid support is the generation 
of the necessary precursor proteins comprising the intein fragment fused to the 
target protein at the genetic level followed by expression of the gene. For example, 
in order to create ia fusion protein of a C-terminal intein fragment fused to the N- 
terminus of a first target protein (target protein 1), the gene encoding* the C-terminal 
intein fragment is cloned in frame to the gene encoding the target protein 1 . The 
genes are arranged so that the C-terminus of the G-teirninal intein fragment is fused 
to the N-terrninus of target protein 1. - ■ ' 

Alteniatively, the precursor protein comprising an N-terminal intein 
fragment fused to a second target protein (target protein 2) is created by cloning the v 
gene encoding the N-terminal intein fragment in frame to the gene encoding target 
protein 2. Specifically, the genes are arranged so that the C-terminus of target 
protein 2 is fused in-frame with the N-terminiis of the N-terminal intein fragment. ■ 
In either case the new gene fusion is placed into a context that permits it's 
transcription iand translation to result in the production of the desired fusion 4 
protein. For example, the fusion genes coiild 'be cloned into an E: coti expression 
•vector such as that described previously for intein purification vectors (Chong, et 
al./Gene 192(2):271-281 (1997); (Evans, et al., Protein 5cz., 7:2256-2264 (1998)), 
but could be any expression vector system. In addition to the generation of 
ribosomally synthesized precursor proteins as described above, the same fusion 
proteins could be chemically synthesized using standard procedures (reviewed in 
: Kent, S. B. ;H., Annu. Rev. Biochenz, 57:957-989 (1988), 

The intein fragments are chosen to be complementary rra/i^-isplicing 
domains. For example, the complementary intein fragments could be chosen from 
the known rra;z5-splicing inteins, such as the Ssp DnaE intein, Ssp DnaB intein. 
Mill RecA intein, Psp Pol-Tintein, Pl-pful intein and PI'/t/wH intein (Yamazaki, et 
al., y. Am, Chem. Soc. 120:5591-5592 (1998), Wu, et al., Biochim. Biophys. Acta, 
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1387:422-432 (1998), Mills, et al., Proc.Nati Acad ScL USA, 95(7):3543-3548 
(1998), Wu, et ah, Proc. Natl. Acad Sci. USA. 95(16):9226-9231 (1998), Otomo, et 
al.,y. Biomol.NMR, 14(2); 105-114, Otomo, et Biochemist n\ 39(49):16040- 
1.6044, and Southworth, el <iU:EMBO /, 17(4):918-926 (1998)): The intein 
fragments used in the present study were the N-terminal 123 amino acids and the 
36 C-terminal amino acids of. the of the Ssp DnaE intein, respectively. 

An affinity domain can be fused to both or either of the N- and C-terminal 
intein splicmg domains so that the N- and C-terminal protein fusion molecules can 
be in^mobilized on a solid support such as an affinity resin. The affinity domain 
■ exemplified herein is the chitin binding domain from B, circulans (Watanabe, et al., 
y. Bacteriol, 176:4465-4472 (1994)).- Following the generation of the precursor 
proteins they are applied to a solid suppon where they are immobilized. The nature 
of the solid support.>vill depend on the goals of the experiment, but may be an 
affinity column. The solid support in the present disclosure is a chitin resin, but 
could be any suppon such as microtiter plates, beads, on materials used in biochips, 
such as glass wafers. . . • . 

The immobilized proteins are ligated together when favorable conditions 
exist to penmit the complementary intein fragments from separate molecules to 
undergo the rwz5*-splicing reaction. Favorable conditions include the proper salt 
concentration, pH, temperature, or the presence of a molecule, such as a reducing 
agent, that facilitates the splicing reaction. The conditions that favor rra/?^^-splicing 
can be elucidated for the intein fragments being used by directly testing its 
perfonriance in vitro m a. variety of conditions. The design of these experiments 
can be as described previously for rranj-splicing inteins (Yamazaki, et al.^ 7. Am. 
Chem. Soc, 120:5591-5592 (1998), Wu, et al., Biochim. Biophys. Acta, 1387:422- 
432 (1998), Mills, et al., Proc. Natl. Acad Sci. USA, 95(7):3543-3548 (1998), Wu, 
et al., Proc. Natl Acad Sci: USA, 95C16):9226-9231 (1998), Otomo, et al., /. 
Biomol NMR, 14(2): 105-1 14, Otoma, et ^l. Biochemistry, 39(49); 16040- 16044, 
and Southworth, et al., EMBO J., 17(4):918-926 (1998)). For example, in the 
present disclosure the 5^/? DnaE intein was found to rran^-splice less efficiently at 
37°C than at 15''C and was less active at pH >10 than at a neutral pH. 
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Following /rfl/?.v-splicing ihe ligatcd protein products are no longer fused to 
the intein fragments or the affinity domain and so these products can be isolated by 
- elating it from the solid support. In contrast, the unused reactants remain bound. 
This permits the localization of the ligation reaction and overcomes many of the 
. disadvantages and problems of the previous technologies noted above. Specifically, 
the present invention allows the facile separation of the ligated protein species from 
the unused reactants and the use of the Ssp DnaE intein eliminates the need for the 
denaturant treatment step to permit the trcins-sphcing reaction. 

' * - ■ ■ --A. 

The in v/vo cyclization reaction of the instant invention begins with the in- 
frame fusion of the gene encoding theC-terminal splicing domain of an intein to 
the gene encoding the target protein and the in-frame*fusion of the gene encoding 
the N-terininal splicing domain of an intein to the gene encoding the same target 
protein. The gene is arranged so that following translation the C-terminus of the C- 
terminal splicing domain is fused to the Nrtemlinus of the target protein and the N-j^v 
terminus of .the N^terminal splicing domain is fused to the C-terminus of the target c, 
protein. This precursor protein is represented as Intein^-target-Inteinf;,. The intein 
used in the present disclosure is the Ssp DnaE intein, but could be any 
complementary intein fragments as described above. 

The Intein^^-target-Inteinj^, fusion gene is placed into a context that permits v 
its transcription and translation to result in the production of the desired precursor 
protein. For example, the gene could be cloned into an E, coli expression vector 
such as those:described previously (Ghong,.et al., Gene 192(2):271-281 (1997); 
Evans, et al., Protein Sci., 7:2256^2264 (1998). The organisms intracellular 
environment and the organisms growth conditions should be favorable for trans- 
spHcing. The conditions that favor trans-spWcmg can be elucidated for the intein 
fragments being used by directly testing its rm/7^-sp!icing activity in the desired 
organism or by testing its performance in vitro in a variety of conditions.- The 
* design of these experiments can be as described previously for rra/z^-spilicing 
inteins (Yamazaki, et al., 7. A/7z. Chem. Soc. 120:5591-5592 (1998), Wu, et al., 
Biochiuh Biophys. Acta, 1387:422-432 (1998), Mills, et al., Proc. Natl Acad. Sci, 
USA, 95(7):3543-3548 (1998), Wu, et al., Proc, Natl Acad, Sci. f/5A, 95(16):9226- 
9231 (1998), Otomo, et al., J, BiomoL NMR, 14(2): 105-1 14, Otomo, et al, 
Biochemistiy, 39(49): 16040-16044, and Southworth, et al., EMBO 7., 17(4):918- 


to trans-spWcc less efficiently at 3TC than at 15°C temperatures. In this case, the 
E. coli cells expressing the Intein^-target-Inteirif^, precursor should be grown at 
IS^'C to facilitate frOT^-splicing and thereby the cyclization reaction. 

. The cyclization reaction itself occurs when the rmn^-splicing of the inteins 
in the Intein^-target-Intein^ precursor generates a peptide bond between the N- and 
C-lenninus of the target protein. This allows circular proteins to be produced in 
organisms that may-not normally be able to do so. 

The in vivo mulLimerizatioa reaction of the present invention also begins 
with an Intein(.-target-Inteinj^, gene fusion. However, following expression of the 
fusion protein the two complementary intein splicing domains from separate 
Intein^^-target-Inteinj^ precursor proteins initiate the splicing reaction to ligate the 
two or more target proteins together. 

The ratio of the intramolecular reaction, cyclization, to the intermolecular 
reaction, mullimerization, may be controlled by engineering the target protein. For 
example if the N- and C- terminus of the target protein cannot come into close 
proximity, then the cyclization reaction will not be favored and multimerization 
should.predominate. However, if the N- and C-terminus of the target protein are 
spatially close to one another, then the cyclization reaction should predominate. 
Jhe determination of the proximity of the and G-terminus of the target protein 
can be estimated based on X-ray or NMR structural data.- However, the exact 
determination of whether the N- and C^terminus of a target . protein can be brought 
together and ligated by rran5'Splicing will need to be determined experimentally. 
Should it be discovered that the target protein does not cyclize, then an extra amino 
acid linker can be added to facilitate the cyclization reaction. The number of anoino 
acids needed in the linker to. ligate the ends of the target protein may be determined 
by adding a linker of increasing length until cyclization occurs, as determined 
experimentally. Alternatively, if X-ray or NMR structural data is available, then a 
starting point for the length of the linker would be to determine the distance 
between the ends of the target protein: This distance would be convened into a trial 
length for the linker by estimating the number of amino acids needed to span that 
distance. However, the final determination ifthe linker is of the proper number of 
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amino acids is by experimcnla! verification of the production of the desired cyclic 
. . protein. . - • . • • 

The //) vitro cyclization reaction of the present invention begins' with the 
Intein^^-target-Inteinj^ gene fusion arranged as described above. The gene is placed 
into a context that pemnits its expression to generate the precursor protein as 
described above. In this example, however, the intracellular environment of the host 
organism or its growth conditions should not favor the rra/25-splicing reaction. 
This is to prevent the cyclization reaction' due to /rara-splicing from occurring in 
vivo. In the present description, the Ssp DnaE intein did not splice proficiently at 
3TC and so induction of protein expression was performed at 37°C. Conditions 
which are not favorable for intein splicing can be determined for the intein being 
used by performing experiments as described above for the determination of 
' . favorable splicing conditions: In addition to the generation of ribosom'ally 
synthesized precursor proteins, the same fiision protein could be chemically 
synthesized using standard procedures (reviewed in Kent, S. B. H., Annu. Rev. 
Biochem.,.57:957-989 (1988)). • . 

The full length fusion protein is then purified on a solid support using an 
affinity tag. The solid support used in the present disclosure is a chitin resin, but 
: could be any solid support such as microtiier plates ■ beads, or rhaterials used in 
• biochips such as glass wafers. Furthermore, while the affmity tag used in the ^'^^ 
. : present invention was ai chitin binding domain (Watanabe, et al., J. Bacteriol, 176 
.4465-4472 (L994)), any affinity tag such as maltose binding protein, His tag, Flag 
tag or the cellulose binding domain could be used. The affinity tag can be present 
on none, both, or one of the intein splicing domains. 

Following inimobilization of the full length fusion protein on an affinity 
I resin any unbound protein can be washed away- Cyclic protein is generated by 
equilibrating the solid support at conditions that favor /ran^-splicing of the intein. 
In the present invention the solid support, a chitin resin, with the immobihzed 
Intein^-target-Intein^ precursor protein was allowed to equilibrate to room 
temperature, which is a temperature that'was determined experimentally to be 
favorable for /ra/z^-splicing of the 55p DnaE intein. The optimal conditions for the 
intein. being used can be determined experimentally as described above.' Trans- 
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splicing of the intein fragments results in a cyclic protein vyhich can be eluted from 
the solid support-. 

Furthermore, as described for the in vivo case, it is possible to generate 
multimers consisting of repeating units of the target protein. As described above 
these are generated by an intermolecular reaction of the Intein^^-target-lntein^. 
precursor proteins. The conditions necessary are the same as for in vitro 
cyclization with the exception that the intermolecular reaction, should be optimized 
over the intramolecular reaction. This needs to be determined experimentalty, but 
can be carried out as described for the in v/w cyclization reaction. , 

The intein based rra/2^-cleavage method of the present invention begins 
with the fusion of a target. protein to the N-terminal intein splicing domain. 
Specifically the C-terminus of the target protein is fused to the N-terminus of the 
intein splicing domain. This Intein ^.-target protein fusion may or may not be 
purified. Because only a portion of the intein is present no unwanted intein 
mediated cleavage occurs in vivo or in vitro. Also, the precursor can be stored in 
the presence of reagents that induce cleavage in other intein fusions or in conditions 
that promote full length intein cleavage. Cleavage is initiated by adding the C- 
terminal intein splicing domain. This may be synthetic or ribosomally synthesized. 
In the present disclosure both a native and a modified C-terminal intein domain 
were used to induce cleavage of the peptide bond at the N-terminus of the 
complementary intein fragment,* the N^terminal ineein splicing domain. Mutations 
that may be made to block intein splicing activity, but which can still allow cleavage 
of ceitain peptide bonds adjacent to the iritein have been described previously 
(Chong, et al., Ge«^, 192(2):271-281 (1997),-Chong, et al., Nucleic AcidsMes. 
26(22):5I09-51 15 (1998), Evans,. et aL, Protein 5d., 7:2256-2264 (1998), Mathys, 
et aL, Gene 231:1-13 (1999), Evans, et al., /. BioL Chem,, 274:3923-3926 (1999), 
Southwprth, et aL, Biotechniques, 27:1 10:120 (1999) and Wood, et £lU Nature 
Biotechnology, n{9)'M9-892 (1999) and reviewed in Evans, et a!., Biopolymers, 
5,l(5):333-342 (1999) and Noren.et dl.,Ange\yandte Chemie Int,.Ed., 39:450-466 
(2000)). Alternatively, the target protein could be fused to the C-terminal intein 
splicing domain. Specifically, the N-tenninus of the target protein is fused to the 
C-temninus of the intein splicing domain.. Cleavage is induced by adding the N- 
terminal splicing dom.ain with or without a mutation to block splicing activity. In 
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either of the above cases, the precursor protein may or may not be purified. 
Purification would be simplified using an affinity domain attached to the intein tag. 
The precursor would be immobilized on the appropriate affinity column and 
unbound proteins could be washed away. Cleavage would be initiated by adding 
the appropriate intein splicing domain. The rra/i^-cleavage reaction could be 
accelerated using reagents or conditions known to induce or increase the rate of 
cleavage in other inteins, for example thiol reagents, pH, or temperature (Chong, et 
dl.. Gene, 192(2):27 1-281 (1997), ChongMal, Nucleic Acids Res. 26(22):5109- 
5115 (1998), Evans, et al.. Protein 5c/.,' 7:2256-2264 (1998), Mathys, et SL^Gene 
231:1-13 (1999), Evans, et al., 1 Biol Chem,, 274:3923-3926 (1999), Sbuthworth, 
et aL, Biotechniques, 27:1 10-120 (1999) and Wood, et al, Nature Biotechnology, 
17(9):889-892 (1999) and reviewed in Evans, et al., Biopolymers-5l{5y3?>?>-'iA2 
(1999) and Noren, et TiX.lAngewandte Chemieint, 39:450-466 (2000)). 

The present invention is further illustrated by the following Examples. .^'^ 
These Examples are provided to aid in the understanding of the invention and are -^^ 
not construed as a limitation thereof. - 

The references cited above and below are herein incorporated by reference. 

EXAMPLE I 

Creation of vectors pMEBl, pMiEB2, pMEB3 for m-splicing studies: 

Construction of Plasniids^pMEB 1 was constructed by replacing the See 
VMA intein in pMYB129 (Chori^g; et al., Gene 192(2):27 1-281 (1997)) with the 
' Ssp DnaE intein sequence spanning residues 5-123 to create a fusion gene 
composed of E. co/rmaltose-binding protein (MBP) (Duplay, et al., J. Biol. Ghent 
259: 10606-10613 (1984)), the Ssp DnaE intein (residues 5-123) and the Bacillus 
circulans chitin binding domain (CBD)l(Watanabe, et al., / fiac?eno/.' 176:4465- 
4472 (1994)). The Ssp DnaE intein fragment was amplified from plasmid pDnaE- 
C209 with primers 5'-TTTGGTACCGAAATTTTAACCGTTGAG-3' (SEQ ID 
NO: 1) and 5'-GGCTCTTCCTTTAATTGTCCCAGCGTCAAG-3' (SEQ ID 
NO:2). The N-tenninal splice junction sequence, containing the flanking 5 native 
N-extein residues and the 5 intein N-terminal residues, was inserted between 
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maltose binding protein (MBP) and the intein coding regions by linker insertion 
into ihe Xhol and Kpnl sites in pMEBl to create pMEB2. The linker-was formed 
by annealing oligonucleotides, 5'-TCGAGAAATTTGCTGAATATTGCCTGTCT 
TTTGGTAC -3' (SEQ ID N0:3) and 5'-CAAAAGACAGGCAATATTCAGCAA 
ATTTC-3' (SEQ ID N0:4). 

The DNA sequence encoding the C-terminal 36 amino acid residues and the 
first 3 C-extein residues (5'-ATGGTTAAAGTTATCGGTCGTAGATCTCTGGG 
CGTGGAGqGCATCTTTGATATCGGTCTGCCGCAGGACCATAACTTTCTG 
CTAGCCAACGGCGCTATCGCTGCTAACTGCTTTAACAAATCC-3'(SEQ 
ID NO:5)) was inserted into pMEB2 to create pMEB3 which expresses a fusion 
protein (MEB) composed of maltose binding protein (MBP), the full length Ssp 
DnaE intein (residues 1-159) with .5 native extein residues at its N-terminus and 3 
native residues at its C-terminus, and the chitin binding domain (CBD). 

Creation of vectors pMEB4, pKEBl, pBELll, and pMEB8 for trans- 
splicing studies: 

A translation termination codon was introduced into pMEB2 following the 

. codon for Lys^"^ of the Ssp DnaE intein by insertion of a linker formed by 
annealing oligonucleotides 5'-AAATAAGGAGGTTAATAAAAGGAAGA 
GCCATGGCGCGCCTTAATTAAA-3' (SEQ ID N0:6) and 5'-CCGGT 
TTAATTAAGGCGCGCCATGGCrCTTCCTTTTATTAACCTCCTTA-^ (SEQ 
ID NO:7). The resulting plasmid, pMEB4, expresses a fusion protein composed of 
maltose binding protein (MBP) and the N-terminal 123 residues of the Ssp DnaE 
intein [DnaE(N)]. pKEBl contains the kanamycin resistance gene and the pl5a 
origin of replication from pACYC177 (Chang and Cohen, J. Bacteriol 134:1141- 
1 156 (1978)). It also expresses a fusion protein composed of the 36 C-terminal 
amino acids of the Ssp DnaE intein [DnaE(C)] followed by 3 native extein residues 
and the chitin binding domain (CBD). pBELl 1 expresses a CBD-DnaE(C)-T4 
DNA ligase fusion protein in the pBSL-C155 vector (Mathys, et al., Gene 231:1-13 
(1999)). 

pMEB8 was generated by transferring the 0.6 kb Xhol to Pstl fragment of 
pMEB3 into pMYB5 (New England Biolabs, Inc., Beverly, MA). Mutation of the 
extein residues in pMEBS was performed by linker substitutions using the X/ioI 
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and Kpnl sites flanking the N-terminal splice junction or the Mi^^l and-Ag^l sites 
flanking the C-terminal splice junction. pMEB8^N2 retains 2 native N-extein 
residues while pMEB8-Cl. C2, or C3 possess 1, 2 or 3 native C-extein residues 
(Figure 1). ■ = 

Creation of vector pMEB21 for in vivo and in vitro cyclization studies: 

The protein cyclization vector, pMEB21, expresses a fusion protein with the 
DnaE(G) immediately followed by amino acid residues CFNISTG (SEQ S3 
NO:8), maltose binding protein (MBP), which terminates with amino acid residues 
GTLEKFAEY (SEQ ID N0:9), and then DnaE(N)-CBD, 

EXAMPLEII ^ 

The cw-splicing activity of the Ssp DnaE intein in vivo: 

The in vivo splicing activity of the full length Ssp DnaE intein was 
investigated by analyzing protein expression fvom E. coli ER2566 cells (Chong, et 
al.. Gene 192(2):271-281 (1997)) bearing plasmid pMEB8. The cells bearing the 
plasmid were grown in LB medium containing the appropriate antibiotic selection at 
37°C with shaking to an OD^^ of 0.5. Protein expression was induced by the 
addition of 0 3 mM Isopropyl fi-D-Thiogalactopyranoside (IPTG) at either 15°C 
for 16 h or at 37°C for 2 h. Crude cell extracts were visualized by electrophoresis 
on a 12% Tris-Glycine gel (Novex, San Diego, CA) followed by staining with 
Coomassie Brilliant Blue. 

EXAMPLE III 

The ligation of two protein in vivo using /ra/i5-splicing: 

The in vivo /m/25-splicing activity of the Ssp DnaE intein was demonstrated 
by analyzing the proteins from E. coli strain ER2566 (Chong, et al., Gene 
192(2):271-281 (1997)) bearing the two compatible plasmids, pMEB4 and pKEBl. 
The E. coli bearing the plasmids were grown in LB medium containing the 
appropriate antibiotics (50 /xg/mL kanamycin and 50 /ig/mL ampicillin) at 37**C 
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with shaking until an OD^^j^ of 0.5 was reached Protein expression was induced 
by the addition of 0.3 mM Isopropyl 6-D-Thiogalactopyranoside (IPTG) at either 
'15?C for 16 h or at 37°C ' for .2 .h. Crude cell extracts were visualized by 
electrophoresis on a 12% Tris-Glycine gel' (Novex, San Diego, GA) followed by 
staining with' Cooniassie Brilliant Blue. 

Protein Purification for in vitro /ran^-splicing or /raHS-cleavage reactions: 

ER2566 cells containing pMEB2 or pBELl 1 were grown at 37°C tt) an CD 
600 of 0.5. Following IPTG (0.5 noM) induced protein expression overnight at 

' 15°C, cells were harvested by centrifugation at 3,000xg for 30 nninutes. The MBP- 
DnaE(N)-CBD (ME(N)B) fusion protein was purified by amylose as described 

. previously (Evans, et al., Protein 5c/., 7:2256-2264 (1998)). The cell pellet was ' 
resuspended in Buffer A (20 mM Tris-HGl, pH 7.0 containing 500 mM NaCl) and 
lysed by sonication. Aftercentrifugalion at 23,000xg for 30 minutes the 
supernatant was applied to a 15 mi amylose resin (New England Biolabs, Inc., 
Beverly, MA) equilibrated in Buffer A. The resin was washed with 10-15 colunm 
volumes of Buffer A. The fusion protein was eluted with Buffer B (20 mM Tris- 
HCl,.pH 7.0 containing 500 mM NaGl and 10 mM maltose). Protein 
concentrations were determined using the:Bio-Rad Protein Assay (Bio-Rad 
Laboratories, Hercules/CA). 

EXAMPLE IV . 

In vitro trans 'Splicing and/or /raws-Cleavage: 

rran5-splicing and/or /ra/75.-cleavage studies of the Ssp DnaE intein were 
conducted in vitro using the purified ME(N)B and two 40 amino acid peptides, 
synthesized as described previously (Evans, et al., Protein Sci. 7:2256-2264 
(1998)), consisting of the C-terminal 36 amino acids of the Ssp DnaE intein, with 
either an Asn (Splice-pep) or an Ala at residue 36 (Cleav-pep), and the next four 
naturally occurring amino acids (CFNK), The splicing peptide had a biotinylated 
lysine (K*) as the C-terminal residue. The reaction consisted of adding either the 
splicing or cleavage peptide (500 :M final concentration) to ME(N)B (1 mg/ml) in 
reaction buffer (100 mM Tris-HCl, pH 7.0 containing 500 mM NaCl) followed by 
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incubation overnight ai room temperature. The on-column rran^y-splicing used the 
GBD-DnaE(C)-T4 DNA ligase protein absorbed onto a chitin resin in which 
. unbound protein was washed off with 20= column volumes of Buffer A. The 
^ ME(N)B fusion protein (9 :M), either free in solution or pre-bound to chitin beads, 
was then added to the chitin bound CBD-DnaE(C)-T4 DNA ligase (3 :M). The 
reactions were then incubated for 16 hours at the appropriate temperature in Buffer 
A and monitored by SDS-PAGE.. 

EXAMPLE V 

In vitro and in vivo Protein Cyclization using a ^raw^-spiicing intein: 

ER2566 cells bearing pMEB21 were grown, induced, harvested and lysed 
- as described in Example II and Example III. The clarified supernatant from celjs 
induced at 15°C was applied to an amy lose resin (10 mL bed volume) whereas Jhe 
clarified supernatant from cells induced, at 37^G was applied to a chitin resin (1|. 
mL bed volume). Unbound proteins were washed from the resin with 20 column 
volumes of Buffer A. Proteins were eluted from the amylose column with Buffer 
B. The intramolecular rm;25-splicing reaction proceeded in vitro when the chitin 
column was incubated for 20 hours at room temperature. Reaction products were 
eluted from the resin with Buffer A. The cyclic maltose binding protein (MBP):was 
analyzed by treatment with Factor Xa (Fxa) (1:100, FXa:protein mass to mass ratio) 

overnight at 4<^C to generate' linearized MBP. The proteolyzed proteins were 
subjected to amino acid sequencing using a Procise 494 protein sequencer (PE 
Applied Biosystems, Foster City, CA). - 

EXAMPLE Vr 

' The ligation of maltose binding protein and thk chitin binding domain 
using a cis-splicing intein 

A gene fusion was created in which full length Ssp DnaE intein had maltose 
binding protein (MBP, 42 kDa) fused to its N-terminus and the chitin binding 
domain fused to its C-terminus. This fusion protein was expressed from the from 
the ER2566 cells bearing plasmid pMEB8 as described in Figure 1 and in Example 
II and Example III above. A band corresponding to the ligation of maltose binding 
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protein and the chitin binding domain (MBP-CBD) (MB) following induction of 
protein expression demonstrated that the Ssp DnaE intein can splice in m with only 
5 native N-terminal and 3 native C-terminal extein residues (Figure 2A, lane 2). 
The identity of splicing products was confirmed by western blot analysis using 
anti-maltose binding protein (anti-MBP) and anti-chitin binding domain (anti- 
CBD) antibodies and binding to chitin and amylose resins (data not shown). In 
addition to the spliced product, there was significant cleavage of the peptide bond at 
the N-terminus of the Ssp DnaE intein. 

EXAMPLE V!! 

Alteration of ci^-splicing by changing the amino acids adjacent to the 
intein 

The amino acids adjacent to the inteih (the extein amino acid residues) were 
changed by mutation in the c/\y-sphcing constmct (Figure 2B)/ Splicing products 
were detected upon induction of ER2566 £ coli cells bearing plasmids pMEB8, 
pMEB8-N2, or pMEB8-C3 as described in Example 1 above. These results 
demonstrated that splicing could occur with either 2 proximal N-extein residues or 
3 proximal C-extein residues (Figure 1 and Figure 2B, lanes ! & 4). However, 
analysis of protein expression following induction of ER2566 £ co/? bells bearing 
plasmids pMEB8-Gl or pMEB8-C2 as described above in Example I did not detect 
any MBP-CBD fusion protein. This demonstrated that splicing activity could be 
altered by reduction of the C-extein sequence to 1 or 2 native amino acid residues 
(Figure 2B, lanes 2 & 3). 

EXAMPLE VIII 

Temperature dependent ^rans-splicing of the Ssp DnaE intein 

The extent of in vivo trans-splicing varied depending on the temperature of 
the E, coli cell growth during induction of protein expression. E. ca//ER2566 
bearing plasmids pMEB4 and pKEBl were induced to express protein as described 
above in Example II. The temperature at which the E. coli cells were incubated 
following induction was either 37°C or IS^'C. There was an accumulation of Ssp 

DnaE intein precursor protein when protein expression was induced at 37°C and 
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this was processed after further growth overnight at 15*^C (Figure 2A and Figure 
2C). 

EXAMPLE IX 

5 . ■ ' * ' 

The ligation of two ribosomally expressed proteins in vitro using the trans- 
splicing Ssp DnaE intein. 

Maltose binding protein was Hgated to T4 DNA ligase by first purifying 
10 Maltose binding protein fused to the N-terminus of the N-terminal splicing domain 

of the Ssp DnaE intein. The chitin binding domain was present as an in-frame 
fusion to the C-terminus of the Ssp DnaE intein N-terminal splicing domain. This 
fusion protein is expressed from plasmid pMEB8 as described in Example II and is 
abbreviated MB P-DnaE(N)-CBD. T4 DNA ligase was purified separately as a 
15 ^ fusion to the C-terminus of the C-terminal splicing domain of the Ssp DnaE intein. ^ 
The chitin binding domain was present as an in-frame fusion to the N^terminus of ^ 
the Ssp DnaE intein C-terminal splicing domain. This fusion protein is expressed 
from plasmid pBELl 1 and is abbreviated CBD-DnaE(C)-T4 DNA ligase. These - ' 
two proteins were bound to a chitin resin through the chitin binding domain 
20 regions. Efficient in vitro rra/25-splicing'OCCurred between the two bacterially- 

expressed proteins, MBP-DnaE(N)-CBD and CBD-DnaE(C):T4 DNA ligase, 

yielding spliced product, MBP-T4 DNA Ligase (ML), at 4'*C and le^'C but - ' 

significantly less at 3rC (Figure 4B). Little difference in splicing efficiency was 
observed when either chitin bound or free ME(N)B was used to react with the chitin 
25 bound BE(C) ligase. Following Factor Xa (Fxa) proteolysis of the released ligation 

product, amino acid sequencing of the 58 kDa band (expected for T4 DNA ligase) 
yielded NH2-GTLEKFAEYCFNIST-COOH (SEQ ID NO: 10) which corresponds 
to the expected sequence of the splice junction. 

30 ' EXAMPLE X 

In vitro ^rans-cleavage or ^raws-splicing using a synthetic peptide 

The in vitro rra/z5-splicing (Figure 3) and/or /r^n^-cleavage activity of the 
35 Ssp DnaE intein was demonstrated using the bacterial ly-expressed ME(N)B 
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precursor and 2 peptides, Splice^pep and Cleav-pep, that mimic the C-terminal Ssp 
DnaE intein fragment (see Example IV above). Both the Splice-pep and the Cleav- 
pep could activate ME(N)B, . resulting in bands, corresponding to the expected 
spliced and/or cleavage product (Figure 4A). Furthermore, the ME(N)B precursor 
was .stable in the absence of either peptide (Figure 4A, lane 1). The cleavage and 
splicing products, maltose binding protein (MBP) and MBP-CFNK*, respectively, 
are indistinguishable by SDS-PAGE. However, a western blot using anti-biotin 
antibody indicated that splicing was occurring, albeit the extent of reaction could not 
be determined (data not shown). 

EXAMPLE X! 

In vivo Protein Cyclization using the S^p DnaE Intein 

Maltose binding protein was cyclized both in vitro and in vivo using the Ssp 
DnaE intein. The C-terminus of the C-terminal Ssp DnaE intein splicing domain 
was fused to the N-terminus of rtialtose binding protein and the N-terminus of the 
N-terminal Ssp DnaE intein splicing domain was fused to the C-terminus of the 
same maltose binding protein. The N~terminal splicing domain of the Ssp DnaE 
intein also had the chitin binding domain fused in-frame to its C-terminus. This 
fusion protein, abbreviated DriaE(C)-MBP-DnaE(N)-CBD, was expressed in E. 
coli ER2566 cells bearing plasmid pMEB21 as described in Example V above. 
Following induction of the cells bearing plasmid pMEB21 the crude cell lysate was 
analyzed by SDS-PAGE (Figure 5A, lane 2) and western blot analysis and 
demonstrated that cells expressing pMEB21 contained precursor protein, DnaE(C)- 
MBP-DnaE(N)-CBD, linear MBP, circular MBP, DnaE(N)-CBD, and DnaE(C)- 
MBP. The putative linear and cyclic MBP species as well as higher molecular 
weight species (Figure 5 A, lane 4) were found to bind to amy lose and elute with 
maltose. r * 
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EXAMPLEXII 

//rv/rra Protdn Cyclization using the 5^/; DnaE Intein 

In v//ra cyclization was perforrned by isolating the precursor consisting of 
DnaE(C)-MBP-DnaE(N)-CBD which could be obtained by inducing protein 

expression of E. coli ER2566 bearing plasniid pMEB21 for 2 hours at 3TC 
(Figure 5B, lane 2) as described in Example V above. The crude cell lysate was 
applied to a chitin resin to which the precursor bound through the CBD. Unbound 
proteins were washed from the column and the cyclization reaction proceeded 

overnight at 23^C. Fractions from the chitin resin contained cyclic and linear MB? 
species (Figure 5B, lane 4). Factor Xa (Fxa) treatment of the isolated proteins 
(Figure 5B, lane 5) followed by amino acid sequencing confirmed the presence of - 
both the linear and circular forms. , - , - . 

EXAMPLE XIII I 

Amino acid sequencing of the ligation junction of cyclic MBP 

The maltose eluted proteins were^subjected to Factor Xa (Fxa)» proteolysis 
and amino acid sequencing. The upperportipn of the 43 kDa band yielded NH2-G # 
TLEKFAEYXFNIST0M-CQ.OH (SEQ © NO: 11) which matched the sequence 
for the cyclic maltosei>inding protein (MBP) that was linearized with Factor Xa 
(Fxa). Sequencing the lower paijt of the 43 kDa band gave NH2-XFNISTGM- 
COOH (SEQ ID NO: 12) which matched the N-terminus of the linear maltose 
binding . protein (MBP) which had not -undergone cyclization. NH2-XVKIG 
RRSLGV-COOH (SEQ ID NO: 13) was obtained from the 45 kDa band and 
correlates with the expected sequence from the DnaE(C)-MBP product. The X 
designates a sequencing cycle in which no amino acid could be assigned with 
confidence. 
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What is claimed is: 

1. A method for producing a fused polypeptide comprising the steps of: 

(a) immobilizing a first polypeptide or portion thereof fused to a first 
intein fragment on a solid support; 

(b) immobilizing a second polypeptide or portion thereof fused to a 
second intein fragment on a solid support wherein said second 
intein fragment is complementary to said first intein fragment; and 

(c) reacting said first and second immobilized polypeptides under 
conditions which favor transplicing of said first and second 
polypeptide to produce a fused polypeptide comprising said first 
and second polypeptides. 

2. The method of claim 1, further comprising the step of eluting the fused 
polypeptide from the solid support. 

3. . ,The method of claim 1, wherein said first and second polypeptides are 
selected from the group consisting of peptides, proteins and enzymes. 

4. The method of claim 1, wherein said sohd suppon is selected from the 
group consisting of affinity based supports, chips, plates, biochip supports, glass 
wafers and microtiter plates. ; 

5. The method of claim 1 , wherein the first and second intein fragments are 
selected from the group consisting of naturally split inteins or artificially split 
inteins. 

6. The method of claim 5, wherein said first and second intein fragments are 
obtained from an intein selected from the group consisting of the Ssp DnaE intein, 
Ssp DnaB intein, Mm RecA intein, Psp Pol-I intein, Pl-p/zd intein or V\-pfu\l intein. 

7. The method of claim 1, wherein the first and second intein fragment further 
comprised an affinity binding domain. 


WOOI/57183 


PCt/USOl/03147 


-26- 

8. The method of claim 7, wherein the affinity binding domain is selected from 
the group consisting of the chitin binding domain from B. circiilans, the maltose 
binding protein from E, coli, a His tag, cellulose binding protein or a Flag-tag. 

9. The method of claim 1 , wherein said first polypeptide comprises an N- 
terminal intein fragment fused to the C-terminus of said first polypeptide. 

10. The method of claim 1 , wherein said second iritein fragment comprises a C- 
terminal intein fragment fused to the N-terminus of said second polypeptide. 

•11. A method for producing a cyclic polypeptide comprising the steps of: 

(a) fusing the C-terminal portion of a split intein to the N-terminus of a 
target polypeptide and fusing the N-terminal portion of a split intein 
* ' to the C-terminus of said target polypeptide to produce a fused 

polypeptide; 

fusing an affinity binding domain to said fused polypeptide; 

immobilizing the product of step (b) on a solid support; and 

incubating the immobilized precursor under conditions that favor 
fomnation of the cyclic polypeptide. 

12. ** A method for the in v/vo=production of a cyclic polypeptide comprising the 
steps of: ^ ' - 

(a) fusing the C-tentiinal portion of a split intein to the N-terminus of a 
- target polypeptide and fusing the N-terminal portion of a split intein 

to the C-terminus of said target polypeiptide to produce a fused 
polypeptide; and • ' ' ■ 

(b) reacting said fused polypeptide in vivo under conditions favoring the 
formation of a cyclic polypeptide. * 


(b) 
(c) 
- (d)- 
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13. The method of claim 1 1 , further comprising the step of eluting said cyclic 
polypeptide from said solid support. 

14. The method of claims 1 1 or 12, wherein said first and second polypeptides 
are selected from the group consisting of peptides, proteins or enzymes. 

15. The method of claim 1 1, wherein said solid support is selected from the 
group consisting of affinity based supports, chips, plates, biochip suppoitst glass 
wafers or microtiter nlates-- . 

16. The method of claims 11 or 12, wherein said split intein is a naturally split 
intein. ^ . 


17. The.method ofclaims'l I.or 12, wherein said split intein is 'an artificially 
split intein. 

1 8 . The method of claim 1 1 , wherein the affinity binding domain is selected 

. from the group consisting of the chitiri binding domain, the maltose binding protein, 
a His tag, the cellulose binding protein or a Flag-tag. 
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Figure 1 
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Figure 3 
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